MS171138.2 

Claims 

What is claimed is: 

1 . A system for personalizing an information classifier, comprising: 

a first classifier, pre-trained with training data, operable to produce a first 
measure associated with a message classification; 

a second classifier, trained with adapting data, operable to produce a 
second measure associated with the message classification; and 

a combining component adapted to combine the first measure and the 
second measure to produce a third measure associated with the message 
classification. 

2. The system of claim 1 , further comprising: 

a weighting component adapted to assign a first weight to the first measure 
and a second weight to the second measure; and 

the combining component further adapted to combine the first measure 
and the second measure to produce the third measure associated with the message 
classification, based, at least in part, on the first measure, the second measure, the 
first weight and the second weight. 

3. The system of claim 2, further comprising 

an aging component adapted to modify the relevance of one or more 
messages based, at least in part, on time-based information associated with a 
message; and 

an adapting component operable to modify the second classifier. 

4. The system of claim 3, further comprising a user interface, operable to 
display information concerning the personalization of the second classifier. 
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5. The system of claim 4, where the information concerning the 
personalization of the second classifier is displayed as a graph. 

6. The system of claim 5, the user interface further operable to accept 
information concerning personalizing the second classifier. 

7. The system of claim 6, where the information concerning personalizing 
the second classifier comprises information related to at least one of the amount 
of adapting data required before a confidence level is associated with the 
personalized classifier and the coverage of adapting data required before a 
confidence level is associated with the personalized classifier. 

8. The system of claim 7 where the information concerning personalizing the 
second classifier further comprises information related to a time period associated 
with a subset of adapting data, for which the relevance of adapting data is altered. 

9. The system of claim 8, the information concerning retraining the second 
classifier further comprising information related to a point in time back to which 
modifications to the second classifier are to be rescinded. 

1 0. The system of claim 1 , where the first classifier and the second classifier 
are implemented in one component. 

1 1 . The system of claim 1 , where the first classifier and the second classifier 
are implemented in separate components. 

12. The system of claim 1 , where the training data employed in training the 
first classifier includes at least one of a header structure, an indicia of junk mail, a 
percentage of non-alphanumeric characters, capitalization patterns, relationships 
in an organization chart, length of messages, times of messages, dates of 
messages, tense usage, presence of questions and number of questions. 
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13. The system of claim 1 , where the adapting data includes at least one of an 
explicit data set and an observation data set. 

14. The system of claim 13, where the explicit data set comprises one or more 
pre-determined messages to be classified by a user of the system, where the user 
classifications of the pre-determined messages are employed in adapting the 
second classifier. 

1 5. The system of claim 1 3, where the observation data set comprises one or 
more messages received by a user of the system. 

1 6. The system of claim 1 5, the observation data set further comprising action 
data associated with the one or more messages received by a user of the system. 

17. The system of claim 1 , where the first classifier employs at least one of a 
support vector methodology, a naive Bayesian processing methodology, a 
sophisticated Bayesian processing methodology, a similarity analysis employing 
dot product and/or cosine function processing and decision tree processing to 
produce the first measure. 

1 8. The system of claim 1 , where the second classifier employs at least one of 
a support vector methodology, a naive Bayesian processing methodology, a 
sophisticated Bayesian processing methodology, a similarity analysis employing 
dot product and/or cosine function processing and decision tree processing to 
produce the second measure. 

19. The system of claim 1 , where the first measure is associated with at least 
one of a probability that the message has a known classification type, the priority 
of the message and the urgency score of the message. 
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20. The system of claim 1 , where the second measure is associated with at 
least one of the probability that a message has a known classification type, the 
priority of a message and the urgency score of a message. 

21. The system of claim 1 , where the third measure is associated with at least 
one of the probability that a message has a known classification type, the priority 
of a message and the urgency score of a message. 

22. The system of claim 2, where the third measure is computed using the 
formula 

F = ml(l - w) + m2(w), where ml is the first measure, where m2 is the 
second measure, where w is the weight assigned to the second measure and where 
(1-w) is the weight assigned to the first measure. 

23. The system of claim 22 where the third measure is normalized to a range 
associated with the range of the first measure and the second measure. 

24. The system of claim 1 where the third measure is employed to determine 
how an email message should be routed. 

25. The system of claim 1 where the third measure is employed to determine 
at least one of, when an email message should be routed, when an email message 
should be sent to a cell phone, when an email message should be archived, when 
an email message should be encrypted and when an email message should be 
deleted. 

26. The system of claim 2, where the weighting component determines the 
first weight and/or the second weight based, at least in part, on the amount of 
adapting data that has been employed in personalizing the second classifier. 
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27. The system of claim 2, where the weighting component determines the 
first weight and/or the second weight based, at least in part, on the coverage of the 
adapting data that has been employed in personalizing the second classifier. 

28. The system of claim 3, where the combining component produces the third 
measure by applying the first weight to the first measure to produce a first 
weighted measure and applying the second weight to the second measure to 
produce a second weighted measure and by combining the first weighted measure 
and the second weighted measure. 

29. The system of claim 3, where the aging component modifies the relevance 
of one or more messages by manipulating at least one of, a weight associated with 
a message and a weight associated with one or more pieces of message data. 

30. The system of claim 1 , further comprising a first data store operable to 
store at least one of, one or more messages employed in training the first 
classifying component and one or more pieces of message data employed in 
training the first classifying component. 

3 1 . The system of claim 30, further comprising a second data store operable to 
store at least one of, one or more messages employed in personalizing the second 
classifier and one or more pieces of message data employed in personalizing the 
second classifier. 

32. The system of claim 30, where the aging component modifies the 
relevance of one or more messages by removing at least one of, the one or more 
messages and the one or more pieces of message data from the first data store. 

33. The system of claim 3 1 , where the aging component modifies the 
relevance of one or more messages by removing at least one of, the one or more 
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messages from the second data store and one or more pieces of message data from 
the second data store. 

34. The system of claim 3, where the adapting component modifies the second 
classifier by adjusting at least one of, one or more data stores, one or more data 
structures, one or more algorithms and one or more rules associated with the 
second classifier. 

35. The system of claim 34, where the adapting component modifies the 
second classifier based, at least in part, one a relationship between the first 
measure and the second measure. 

36. A data packet adapted to be transmitted between two or more computer 
processes comprising: 

information related to personalizing an information classifier, the 
information comprising at least one of weighting data, aging data and adapting 
data. 

37. A computer readable medium containing computer executable 
components of a system for personalizing an information classifier, comprising: 

a first classifying component, pre-trained with training data, operable to 
produce a first measure associated with a message classification; 

a second classifying component, trained with adapting data, operable to 
produce a second measure associated with the message classification; 

a weighting component adapted to assign a first weight to the first measure 
and a second weight to the second measure; 

a combining component adapted to combine the first measure and the 
second measure to produce a third measure associated with the message 
classification, the combining component basing the combination, at least in part, 
on the first measure, the second measure, the first weight and the second weight; 
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an aging component adapted to modify the relevance of one or more 
messages based and/or one or more pieces of message data, at least in part, on 
time-based information associated with a message; and 

an adapting component operable to modify the second classifier. 

38. A method for personalizing an information classifying process, 

comprising: 

receiving a message to be classified; 

producing a first measure that the message is classified as having one of N 
characteristics, N being an integer; 

producing a second measure that the message is classified as having one of 
N characteristics, N being an integer; 

combining the first measure with the second measure to produce a third 
measure that the message is classified as having one of N characteristics, N being 
an integer, where the combining depends, at least in part, on a first weight 
associated with the first measure and a second weight associated with the second 
measure; and 

updating at least one of a data store, a data structure, an algorithm, a 
process, a thread and a rule employed in generating the second measure, based, at 
least in part, on a relationship between the first measure and the second measure. 

39. The method of claim 38, comprising displaying information associated 
with personalizing the information classifying process. 

40. The method of claim 39, where displaying information associated with 
personalizing the information classifying process includes displaying at least one 
of a measure associated with the degree of personalization associated with 
generating the second measure and one or more time periods over which the 
personalizing has occurred. 
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41 . The method of claim 38, further comprising accepting information 
associated with personalizing the information classifying process. 

42. The method of claim 41 , where the information associated with 
personalizing the information classifying process includes at least one of a size 
associated with an adapting data set employed in personalizing the classifying 
process, one or more time periods for which the weights assigned to 
personalization applied to the process employed in generating the second measure 
is to be manipulated and a point in time to which the process for generating the 
second measure should be reset. 

43. The method of claim 38, further comprising: 

storing at least one of one or more classified messages and one or more 
message data points; and 

manipulating the relevance of at least one of the one or more classified 
messages and the one or more message data points as related to determining at 
least one of the first and second probabilities based, at least in part, on temporal 
data associated with the one or more stored classified messages and one or more 
message data points. 

44. The method of claim 43, where manipulating the relevance of at least one 
of the one or more classified messages and one or more message data points 
includes deleting at least one of the one or more classified messages and the one 
or more message data points. 

45. The method of claim 43, where manipulating the relevance of at least one 
of the one or more classified messages and one or more message data points 
involves changing one or more weights associated with at least one of the one or 
more classified messages and the one or more message data points. 
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46. The method of claim 38 where the N characteristics comprise at least one 
of the probability that a message is of a known type, a message priority, an 
urgency score and a computed expected urgency. 

47. A computer readable medium having computer executable instructions for 
performing the method recited in claim 38. 

48. The computer readable medium of claim 47 further including computer 
executable instructions operable to display information associated with 
personalizing the information classifying process. 

49. A method for personalizing an information classifying process, 
comprising: 

producing a measure associated with a message classification; 

updating at least one of a data store, a data structure, an algorithm, a 
process, a thread and a rule employed in generating the measure, based, at least in 
part, on a relationship between the measure and a user input provided during 
supervised learning. 

50. A system for personalizing an information classifier, comprising: 
means for producing a first measure associated with a message 

classification, the first measure being associated with at least one of a probability 
that the message has a known classification type, the priority of the message and 
the urgency score of the message; 

means for producing a second measure associated with the message 
classification, the second measure being associated with at least one of the 
probability that the message has a known classification type, the priority of the 
message and the urgency score of the message; and 

means for combining the first measure and the second measure to produce 
a third measure associated with the message classification, the third measure 
being produced using the formula F = ml (1 - w) + m2(w), where ml is the first 
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measure, where m2 is the second measure, where w is the weight assigned to the 
second measure and where (1-w) is the weight assigned to the first measure. 
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